计算机与现代化 ›› 2010, Vol. 1 ›› Issue (6): 17-0020.doi: 10.3969/j.issn.1006-2475.2010.06.006

• 算法设计与分析 • 上一篇    下一篇

一种基于分布式数据库的全局频繁项挖掘算法

邓丽君   

  1. 上海海事大学信息工程学院,上海 200135
  • 收稿日期:2010-02-03 修回日期:1900-01-01 出版日期:2010-07-01 发布日期:2010-07-01

An Algorithm for Mining Global Frequent Item-sets Based on Distributed Database

DENG Li-jun   

  1. College of Information Science and Engineering, Shanghai Maritime University, Shanghai 200135, China
  • Received:2010-02-03 Revised:1900-01-01 Online:2010-07-01 Published:2010-07-01

摘要: 针对FDM(Fast Distributed Association Rules Mining)可能会出现全局频繁项集丢失的问题,引入双射空间的思想和标识位,提出一种DS-CFDM(Double spaceFast Distributed association rules Mining based on Center)算法,算法实现通过一中心站点来协调信息的传输以此来减少网络通信的负担。

关键词: 分布式数据库, 全局频繁项, 数据挖掘, FDM, DSCFDM

Abstract: Aiming at the Fast Distributed association rules Mining (FDM) algorithm which perhaps
loses some global frequent K-item sets, an improved algorithm, named as DS-CFDM, is presented
to solve the problems of FDM. The improved algorithm introduces the ideas of dual-space and
identification bit. In order to reduce the burden of network, a center site is used to coordinate
the information transmission during the implementation of the algorithm.

Key words: distributed database, global frequent item-sets, data mining, FDM, DS-CFDM

中图分类号: